Periscope/SQ: Interactive Exploration of Biological Sequence Databases

نویسندگان

  • Sandeep Tata
  • Willis Lang
  • Jignesh M. Patel
چکیده

Life science laboratories today have to rely on procedural techniques to store and manage large sequence datasets. Procedural techniques are cumbersome to use and are often very inefficient compared to optimized declarative techniques. We have designed and implemented a system called Periscope/SQ that makes it possible to rapidly express complex queries within a declarative framework and take advantage of database-style query optimization. As a result, queries in Periscope/SQ run orders of magnitude faster than typical procedural implementations. We demonstrate the power of Persicope/SQ through an application called GeneLocator which allows biologists to rapidly explore large genomic sequence databases.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Declarative Querying for Biological Sequence Databases

The ongoing revolution in life sciences research is producing vast amounts of genetic and proteomic sequence data. Scientists want to pose increasingly complex queries on this data, but current methods for querying biological sequences are primitive and largely procedural. This limits the ease with which complex queries can be posed, and often results in very inefficient query plans. There is a...

متن کامل

Similarity Searching of Secondary Structures in Protein Sequences

The progress of scientific research in the life sciences community is intimately connected to the progress in the database community in that analyzing the mountains of genetic and biological information available involves being able to maintain and query gene and protein databases. In spite of the many decades of progress in database research, surprisingly scientists in the life sciences commun...

متن کامل

Motif Explorer - a Tool for Interactive Exploration of Aminoacid Sequence Motifs

Short amino acid sequence patterns, called motifs, play an important role in molecular biology research. While a number of tools for locating motifs in sequence databases have been developed, no existing tool performs fast enough to allow interactive searching of entire databases. Interactive searching enables biologists to explore the effects of changes to a motif on the set of sequences match...

متن کامل

SRS browser: a visual interface to the sequence retrieval system

This paper presents a novel approach to the visual exploration and navigation of complex association networks of biological data sets, e.g., published papers, gene or protein information. The generic approach was implemented in the SRS Browser as an alternative visual interface to the highly used Sequence Retrieval System (SRS) [1]. SRS supports keyword-based search of about 400 biomedical data...

متن کامل

Protein Databases

Proteins are sources of many peptides with diverse biological activity. Some of them are considered as valuable components of foods and drug targets with desired and designed biological activity. We are now entering an era rich in biological data in which the field of bioinformatics is poised to exploit this information in increasingly powerful ways. There are currently many databases all over ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007